Add voice interactions with Gemini Live and ros-mcp-server to Gemini example. #115

tracelarue · 2025-09-24T15:55:44Z

Gemini Live for low-latency bidirectional voice interactions with ros-mcp-server.

Added gemini_live to examples/2_gemini.
Enables audio input from the user and audio output from Gemini.
Enables Gemini Live to use ros-mcp-server.
Tested in ubuntu 22.04, python 3.10, ros2 humble.

stex2005 · 2025-09-24T16:09:17Z

Thank you for your contribution, @tracelarue. I will give it a try soon.

@rjohn-v — I’d suggest adding a client/ folder in the repository to store installation packages and runnable clients. This would help keep different client implementations (e.g., Gemini API client) and their installation steps organized in one place. I’m not sure I’d keep this under examples/.

stex2005 · 2025-09-24T16:14:58Z

I connected issue robotmcp/robot-mcp-client#20 to this PR.
After, we can close it and reopen for other APIs.

stex2005 · 2025-09-26T00:25:00Z

Raises this error during installation, seems that I need a system-package: portaudio.

I woudl recommend trying to include this into dependencies in the README.md + comamnd to install:

sudo apt install portaudio19-dev

stex2005 · 2025-09-26T00:35:33Z

Couldn't run uv run on my WSL Ubuntu. Please specify that this works only on Ubuntu, will try on Ubuntu soon.

stex2005 · 2025-09-26T00:41:06Z

@tracelarue @rjohn-v Another good next step would be to provide a dockerized version of the Gemini client, so it can be run more easily in different environments. Since the client is only a tool within this project, I don’t think we should invest too much effort in tightly integrating it into the repo. A simplified version of client_gemini (without audio support) would already be a good, lightweight solution and would find a good place in clients folder.

examples/2_gemini/gemini_live/mcp_config.json

examples/2_gemini/gemini_live/pyproject.toml

examples/2_gemini/gemini_live/README.md

examples/2_gemini/gemini_live/mcp_handler.py

stex2005 · 2025-09-26T00:58:53Z

Couldn't run uv run on my WSL Ubuntu. Please specify that this works only on Ubuntu, will try on Ubuntu soon.

This is the same error when I try to run mcp_handler.py

tracelarue · 2025-09-26T20:05:03Z

@stex2005 Thank you for the review and feedback. I'll work on getting these changes and fixes implemented.

mokcontoro · 2025-09-27T14:57:48Z

@tracelarue wow, voice command sounds super cool. thanks for your contributions. I cannot wait for trying this feature soon!

tracelarue · 2025-10-04T19:21:45Z

@stex2005 Ready for review

Changes made:

removed mcp_config.json and instruct user to create their own
removed pyproject.toml and uv.lock. Now uses uv pip install to install dependencies in the existing ros-mcp .venv
updated README.md to specify .env location
renamed gemini_live.py to gemini_client.py
specified it only works on ubuntu in README.md (try WSL again after these changes)
removed mcp_handler.py in favor of Google's latest method to connect to mcp servers
Other gemini_client.py improvements and README.md updates

stex2005 · 2025-10-05T04:29:44Z

@stex2005 Ready for review

Changes made:

removed mcp_config.json and instruct user to create their own

removed pyproject.toml and uv.lock. Now uses uv pip install to install dependencies in the existing ros-mcp .venv

updated README.md to specify .env location

renamed gemini_live.py to gemini_client.py

specified it only works on ubuntu in README.md (try WSL again after these changes)

removed mcp_handler.py in favor of Google's latest method to connect to mcp servers

Other gemini_client.py improvements and README.md updates

@tracelarue thank you for the contribution. I will try it :)

stex2005 · 2025-10-05T05:42:00Z

@tracelarue thanks again for your contribution.

I've tried your example on WSL and Ubuntu and it still gives some compatibility issues due to hardware/driver changes on camera/microphone. I would suggest making this example a simplified version of the gemini_client that works only with text input/output. At this moment the example code is too broad and complex to be maintained.

I am working on this simplified example and I will soon share it.

stex2005 · 2025-10-06T22:20:50Z

@tracelarue thanks again for your contribution.

I've tried your example on WSL and Ubuntu and it still gives some compatibility issues due to hardware/driver changes on camera/microphone. I would suggest making this example a simplified version of the gemini_client that works only with text input/output. At this moment the example code is too broad and complex to be maintained.

I am working on this simplified example and I will soon share it.

@tracelarue Thank you so much again for this contribution. We are debating whether client implementations should be in this or another repository. I would keep this PR on hold until we decide.

Meantime, I would suggest working on the gemini_client lite version, to have it work on both Ubuntu machines and with WSL.

tracelarue · 2025-10-07T14:05:52Z

@stex2005 Thanks for the review. I’ll start working on a text-only version and test it on WSL for compatibility. Could you share your hardware setup and specify where the issue occurred? Was the gemini_client able to launch and connect to the MCP? I’ve previously run the live version on a Raspberry Pi, where I had to adjust the mic and speaker sampling rates for hardware compatibility.

stex2005 · 2025-10-07T20:23:33Z

@stex2005 Thanks for the review. I’ll start working on a text-only version and test it on WSL for compatibility. Could you share your hardware setup and specify where the issue occurred? Was the gemini_client able to launch and connect to the MCP? I’ve previously run the live version on a Raspberry Pi, where I had to adjust the mic and speaker sampling rates for hardware compatibility.

This is the first attempt: https://github.com/stex2005/ros-mcp-client

stex2005 · 2025-10-08T19:04:13Z

We are temporarily closing this PR. Will be implemented in ros-mcp-client

Added Gemini Live with ros-mcp-server example to Gemini example.

b90dc09

tracelarue changed the title ~~Voice interactions with Gemini Live and ros-mcp-server added to Gemini example.~~ Add voice interactions with Gemini Live and ros-mcp-server to Gemini example. Sep 24, 2025

stex2005 requested review from lpigeon, rjohn-v and stex2005 and removed request for lpigeon September 24, 2025 16:05

stex2005 requested changes Sep 26, 2025

View reviewed changes

tracelarue added 11 commits October 1, 2025 22:29

renamed to gemini_client.py, readme updates

2412663

removed mcp_config.json

fcc86f4

removed project inside of repository

a170655

added requirements.txt

31fcd4e

removed mcp_handler.py

f3c1d6e

Updated gemini_client.py

1f8c825

Updated readme

68afc99

ruff format fixes

d3eeb5c

Updated readme to use uv

0e8bc41

Updated gemini_client.py to pull from mcp_config.json.

5b82396

Readme updated af testing

0d49d95

tracelarue requested a review from stex2005 October 4, 2025 19:14

Merge branch 'develop' into Gemini-Live-with-ros-mcp-server

db536b5

Update .gitignore

753a554

stex2005 closed this Oct 8, 2025

Add voice interactions with Gemini Live and ros-mcp-server to Gemini example. #115

Add voice interactions with Gemini Live and ros-mcp-server to Gemini example. #115

Uh oh!

Conversation

tracelarue commented Sep 24, 2025

Uh oh!

stex2005 commented Sep 24, 2025

Uh oh!

stex2005 commented Sep 24, 2025

Uh oh!

stex2005 commented Sep 26, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

stex2005 commented Sep 26, 2025

Uh oh!

stex2005 commented Sep 26, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

stex2005 commented Sep 26, 2025

Uh oh!

tracelarue commented Sep 26, 2025

Uh oh!

mokcontoro commented Sep 27, 2025

Uh oh!

tracelarue commented Oct 4, 2025

Uh oh!

stex2005 commented Oct 5, 2025

Uh oh!

stex2005 commented Oct 5, 2025

Uh oh!

stex2005 commented Oct 6, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

tracelarue commented Oct 7, 2025

Uh oh!

stex2005 commented Oct 7, 2025

Uh oh!

stex2005 commented Oct 8, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

stex2005 commented Sep 26, 2025 •

edited

Loading

stex2005 commented Oct 6, 2025 •

edited

Loading